Digital Curation as a Key Component in Research Infrastructures: From Data Preservation to Processes Preservation and Verification
نویسنده
چکیده
With the advent of data-driven science, also referred to as, for example, the Fourth Paradigm, Big Data, and other similar concepts, the need to safeguard the investments made into collecting and preparing massive amounts of data (some of which is unrecoverable) has drastically gained importance. Providing digital preservation of research data is thus emerging as a service that has to be provided by sophisticated research infrastructure frameworks. Yet, with the complexity of research processes increasing, the needs for preservation stretch beyond merely maintaining data accessible. Capturing and documenting the context of its creation and use is an enormous task, requiring sophisticated representation information networks. Even more challenging, complex processes are an integral part of data provenance. We thus also need to capture, preserve, and maintain usable a series of data processing routines and modules in order to be able to establish the validity of scientific analysis, to repeat earlier computations on new data, in short to make full use of the opportunities offered by data-intensive science. This tutorial will start with a brief review of the classical challenges in digital preservation. It will then move on to motivate the need for process preservation as part of data curation. This will be followed by a presentation of approaches to facilitate process preservation, most notably process context capture as well as recommendations on how to ease process preservation by proper design.
منابع مشابه
Study of the foundation, models and issues of research data curation and management in scientific and academic environments
Background and Aim: The purpose of this paper is to study, identifying and discuss the foundation and concepts, models and frameworks, dimensions and challenges of research data curation and management in scientific and academic environments. Method: This article is a review article and library method was used to collect scientific and research texts in this field. In this research, external an...
متن کاملProcess Management Plans
In the era of research infrastructures and big data, sophisticated data management practices are becoming essential building blocks of successful science. Most practices follow a data-centric approach, which does not take into account the processes that created, analysed and presented the data. This fact limits the possibilities for reliable verification of results. Furthermore, it does not gua...
متن کاملView From Across the Pond: Opportunities, Gaps, and Challenges in Digital Curation Lifelong Learning
While some excellent lifelong learning programs in digital curation and preservation for cultural heritage information professionals exist in the US, most activities are sporadic and depend on temporary and shrinking grant funding. How best to provide continuing education on digital curation and preservation remains an open question. This paper will critique current programs, discuss key issues...
متن کاملChronopolis Digital Preservation Network
The Chronopolis Digital Preservation Initiative, one of the Library of Congress’ latest efforts to collect and preserve at-risk digital information, has completed its first year of service as a multimember partnership to meet the archival needs of a wide range of domains. Chronopolis is a digital preservation data grid framework developed by the San Diego Supercomputer Center (SDSC) at UC San D...
متن کاملToward Distributed Infrastructures for Digital Preservation: The Roles of Collaboration and Trust
This paper first explores some of the reasons why collaboration is becoming increasingly important in supporting scientific data curation, digital preservation initiatives and institutional repository development. It then investigates the concepts of trust and control used in the organisation science literature and attempts to apply them to the work on trustworthy repositories being carried out...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012